AITopics | Sydney

Collaborating Authors

Sydney

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Zhong, Lucen, Du, Zhengxiao, Zhang, Xiaohan, Hu, Haiyi, Tang, Jie

arXiv.org Artificial IntelligenceJan-17-2025

Enhancing large language models (LLMs) with real-time APIs can help generate more accurate and up-to-date responses. However, evaluating the function calling abilities of LLMs in real-world scenarios remains under-explored due to the complexity of data collection and evaluation. In this work, we introduce ComplexFuncBench, a benchmark for complex function calling across five real-world scenarios. Compared to existing benchmarks, ComplexFuncBench encompasses multi-step and constrained function calling, which requires long-parameter filing, parameter value reasoning, and 128k long context. Additionally, we propose an automatic framework, ComplexEval, for quantitatively evaluating complex function calling tasks. Through comprehensive experiments, we demonstrate the deficiencies of state-of-the-art LLMs in function calling and suggest future directions for optimizing these capabilities. The data and code are available at \url{https://github.com/THUDM/ComplexFuncBench}.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2501.10132

Country:

North America > United States > New York (0.05)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(6 more...)

Genre: Workflow (0.68)

Industry:

Transportation > Infrastructure & Services > Airport (1.00)
Transportation > Air (1.00)
Consumer Products & Services (0.94)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Wang, Boshi, Fang, Hao, Eisner, Jason, Van Durme, Benjamin, Su, Yu

arXiv.org Artificial IntelligenceMar-7-2024

Tools are essential for large language models (LLMs) to acquire up-to-date information and take consequential actions in external environments. Existing work on tool-augmented LLMs primarily focuses on the broad coverage of tools and the flexibility of adding new tools. However, a critical aspect that has surprisingly been understudied is simply how accurately an LLM uses tools for which it has been trained. We find that existing LLMs, including GPT-4 and open-source LLMs specifically fine-tuned for tool use, only reach a correctness rate in the range of 30% to 60%, far from reliable use in practice. We propose a biologically inspired method for tool-augmented LLMs, simulated trial and error (STE), that orchestrates three key mechanisms for successful tool use behaviors in the biological system: trial and error, imagination, and memory. Specifically, STE leverages an LLM's 'imagination' to simulate plausible scenarios for using a tool, after which the LLM interacts with the tool to learn from its execution feedback. Both short-term and long-term memory are employed to improve the depth and breadth of the exploration, respectively. Comprehensive experiments on ToolBench show that STE substantially improves tool learning for LLMs under both in-context learning and fine-tuning settings, bringing a boost of 46.7% to Mistral-Instruct-7B and enabling it to outperform GPT-4. We also show effective continual learning of tools via a simple experience replay strategy.

api, query, weather, (13 more...)

arXiv.org Artificial Intelligence

2403.04746

Country:

North America > United States > New York (0.06)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.06)
North America > United States > California > San Francisco County > San Francisco (0.05)
(25 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports > Soccer (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Perspective of Software Professionals on Algorithmic Racism

Santos, Ronnie de Souza, de Lima, Luiz Fernando, Magalhaes, Cleyton

arXiv.org Artificial IntelligenceJun-26-2023

Context. Algorithmic racism is the term used to describe the behavior of technological solutions that constrains users based on their ethnicity. Lately, various data-driven software systems have been reported to discriminate against Black people, either for the use of biased data sets or due to the prejudice propagated by software professionals in their code. As a result, Black people are experiencing disadvantages in accessing technology-based services, such as housing, banking, and law enforcement. Goal. This study aims to explore algorithmic racism from the perspective of software professionals. Method. A survey questionnaire was applied to explore the understanding of software practitioners on algorithmic racism, and data analysis was conducted using descriptive statistics and coding techniques. Results. We obtained answers from a sample of 73 software professionals discussing their understanding and perspectives on algorithmic racism in software development. Our results demonstrate that the effects of algorithmic racism are well-known among practitioners. However, there is no consensus on how the problem can be effectively addressed in software engineering. In this paper, some solutions to the problem are proposed based on the professionals' narratives. Conclusion. Combining technical and social strategies, including training on structural racism for software professionals, is the most promising way to address the algorithmic racism problem and its effects on the software solutions delivered to our society.

artificial intelligence, machine learning, racism, (17 more...)

arXiv.org Artificial Intelligence

2306.15133

Country:

North America > United States > California (0.14)
South America > Brazil > Pernambuco > Recife (0.04)
South America > Brazil > Ceará (0.04)
(5 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry: Law > Civil Rights & Constitutional Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Software (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Inverse design of nano-photonic wavelength demultiplexer with a deep neural network approach

Yuan, Mengwei, Yang, Gang, Song, Shijie, Zhou, Luping, Minasian, Robert, Yi, Xiaoke

arXiv.org Artificial IntelligenceMay-15-2022

In this paper, we propose a pre-trained-combined neural network (PTCN) as a comprehensive solution to the inverse design of an integrated photonic circuit. By utilizing both the initially pre-trained inverse and forward model with a joint training process, our PTCN model shows remarkable tolerance to the quantity and quality of the training data. As a proof of concept demonstration, the inverse design of a wavelength demultiplexer is used to verify the effectiveness of the PTCN model. The correlation coefficient of the prediction by the presented PTCN model remains greater than 0.974 even when the size of training data is decreased to 17%. The experimental results show a good agreement with predictions, and demonstrate a wavelength demultiplexer with an ultra-compact footprint, a high transmission efficiency with a transmission loss of -2dB, a low reflection of -10dB, and low crosstalk around -7dB simultaneously.

inverse design, ptcn model, wavelength demultiplexer, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1364/OE.462038

2206.07114

Country:

Oceania > Australia (0.14)
North America > Canada > Nova Scotia > Cape Breton County > Sydney (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Eliminating the Weakest Link: Making Manipulation Intractable?

Davies, Jessica (University of Toronto) | Narodytska, Nina (NICTA and University of New South Wales) | Walsh, Toby (NICTA and University of New South Wales)

AAAI ConferencesJul-21-2012

Successive elimination of candidates is often a route to making manipulation intractable to compute. We prove that eliminating candidates does not necessarily increase the computational complexity of manipulation. However, for many voting rules used in practice, the computational complexity increases. For example, it is already known that it is NP-hard to compute how a single voter can manipulate the result of single transferable voting (the elimination version of plurality voting). We show here that it is NP-hard to compute how a single voter can manipulate the result of the elimination version of veto voting, of the closely related Coombs’ rule, and of the elimination versions of a general class of scoring rules.

manipulation, manipulator, vote, (15 more...)

AAAI Conferences

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York (0.04)
North America > Canada > Nova Scotia > Cape Breton County > Sydney (0.04)

Industry:

Leisure & Entertainment > Sports (0.68)
Government > Voting & Elections (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback

Propagating Conjunctions of AllDifferent Constraints

Bessiere, Christian (LIRMM, CNRS) | Katsirelos, George (CRIL-CNRS) | Narodytska, Nina (NICTA and UNSW) | Quimper, Claude-Guy (Universite Laval) | Walsh, Toby (NICTA and UNSW)

AAAI ConferencesJul-15-2010

We study propagation algorithms for the conjunction of two AllDifferent constraints. Solutions of an AllDifferent constraint can be seen as perfect matchings on the variable/value bipartite graph. Therefore, we investigate the problem of finding simultaneous bipartite matchings. We present an extension of the famous Hall theorem which characterizes when simultaneous bipartite matchings exists. Unfortunately, finding such matchings is NP-hard in general. However, we prove a surprising result that finding a simultaneous matching on a convex bipartite graph takes just polynomial time. Based on this theoretical result, we provide the first polynomial time bound consistency algorithm for the conjunction of two AllDifferent constraints. We identify a pathological problem on which this propagator is exponentially faster compared to existing propagators. Our experiments show that this new propagator can offer significant benefits over existing methods.

artificial intelligence, constraint, constraint-based reasoning, (15 more...)

AAAI Conferences

Twenty-Fourth AAAI Conference on Artificial Intelligence

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > Canada > Quebec (0.04)
North America > Canada > Nova Scotia > Cape Breton County > Sydney (0.04)
Europe > France > Occitanie > Hérault > Montpellier (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)

Add feedback